Human Object Interaction Detection


Human-object interaction (HOI) detection is a task of identifying a set of interactions in an image, which involves the localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and the classification of the interaction labels.

Dynamic Scene Understanding from Vision-Language Representations

Add code
Jan 20, 2025
Viaarxiv icon

Soft Vision-Based Tactile-Enabled SixthFinger: Advancing Daily Objects Manipulation for Stroke Survivors

Add code
Jan 12, 2025
Viaarxiv icon

A Multimodal Dataset for Enhancing Industrial Task Monitoring and Engagement Prediction

Add code
Jan 10, 2025
Viaarxiv icon

NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model

Add code
Jan 01, 2025
Viaarxiv icon

Interacted Object Grounding in Spatio-Temporal Human-Object Interactions

Add code
Dec 27, 2024
Viaarxiv icon

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

Add code
Dec 11, 2024
Viaarxiv icon

ContextHOI: Spatial Context Learning for Human-Object Interaction Detection

Add code
Dec 12, 2024
Viaarxiv icon

Precision-Enhanced Human-Object Contact Detection via Depth-Aware Perspective Interaction and Object Texture Restoration

Add code
Dec 13, 2024
Viaarxiv icon

Autonomous Navigation in Dynamic Human Environments with an Embedded 2D LiDAR-based Person Tracker

Add code
Dec 19, 2024
Figure 1 for Autonomous Navigation in Dynamic Human Environments with an Embedded 2D LiDAR-based Person Tracker
Figure 2 for Autonomous Navigation in Dynamic Human Environments with an Embedded 2D LiDAR-based Person Tracker
Figure 3 for Autonomous Navigation in Dynamic Human Environments with an Embedded 2D LiDAR-based Person Tracker
Figure 4 for Autonomous Navigation in Dynamic Human Environments with an Embedded 2D LiDAR-based Person Tracker
Viaarxiv icon

Grasp What You Want: Embodied Dexterous Grasping System Driven by Your Voice

Add code
Dec 14, 2024
Viaarxiv icon